Spectral Study of the Vocal Tract in Vowel Synthesis: A Comparison between 1D and 3D Acoustic Analysis

نویسندگان

  • Negar M. Harandi
  • Daniel Aalto
  • Antti Hannukainen
  • Jarmo Malinen
  • Sidney S. Fels
چکیده

A state-of-the-art 1D acoustic synthesizer has been previously developed, and coupled to speaker-specific biomechanical models of oropharynx in ArtiSynth. As expected, the formant frequencies of the synthesized vowel sounds were shown to be different from those of the recorded audio. Such discrepancy was hypothesized to be due to the simplified geometry of the vocal tract model as well as the one dimensional implementation of Navier-Stokes equations. In this paper, we calculate Helmholtz resonances of our vocal tract geometries using 3D finite element method (FEM), and compare them with the formant frequencies obtained from the 1D method and audio. We hope such comparison helps with clarifying the limitations of our current models and/or speech synthesizer.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Semi-Polar Grid Strategy for the Three-Dimensional Finite Element Simulation of Vowel-Vowel Sequences

Three-dimensional computational acoustic models need very detailed 3D vocal tract geometries to generate high quality sounds. Static geometries can be obtained from Magnetic Resonance Imaging (MRI), but it is not currently possible to capture dynamic MRI-based geometries with sufficient spatial and time resolution. One possible solution consists in interpolating between static geometries, but t...

متن کامل

Comparison of vocal tract transfer functions calculated using one-dimensional and three-dimensional acoustic simulation methods

Acoustic characteristics of the vocal tract have been investigated extensively in the literature using a onedimensional (1D) acoustic simulation method. Because the 1D method assumes plane wave propagation only, it is recognized to be valid only in the low frequency region (below about 4 or 5 kHz). Recently, a three-dimensional (3D) acoustic simulation method was developed, to obtain more preci...

متن کامل

Synthesis of VV Utterances from Muscle Activation to Sound with a 3D Model

We propose a method to automatically generate deformable 3D vocal tract geometries from the surrounding structures in a biomechanical model. This allows us to couple 3D biomechanics and acoustics simulations. The basis of the simulations is muscle activation trajectories in the biomechanical model, which move the articulators to the desired articulatory positions. The muscle activation trajecto...

متن کامل

بررسی ساختار سازه‌ای واکه‌های زبان فارسی در بزرگ‌سالان دوزبانه آذری فارسی

Objective: Vowels are the center of syllables while formant structures are one of the most important acoustic characteristics of speech sounds that help in their articulatory and perceptual aspects. Formants represent the shape and size of the vocal tract. There exist trivial differences between the vocal tracts of different people due to which the formant structures of a vowel in one person ar...

متن کامل

Acoustic Analysis of Persian EFL Learners' Pronunciation of English Vowels

This paper reports the results of an experimental study on non-native production of English vowels. Two groups of Persian EFL learners varying in language proficiency were tested on their ability to produce the nine plain vowels of American English. Vowel production accuracy was assessed by means of acoustic measurements. Ladefoged and Maddison’s (1996) F1 F2 measurements for American English v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1512.05811  شماره 

صفحات  -

تاریخ انتشار 2015